24 research outputs found
Recommended from our members
Quantitative Approaches to the Genomics of Clonal Evolution
Many problems in the biological sciences reduce to questions of genetic evolution. Entire classes of medical pathology, such as malignant neoplasia or infectious disease, can be viewed in the light of Darwinian competition of genomes. With the benefit of today's maturing sequencing technologies we can observe and quantify genetic evolution with nucleotide resolution. This provides a molecular view of genetic material that has adapted, or is in the process of adapting, to its local selection pressures. A series of problems will be discussed in this thesis, all involving the mathematical modeling of genomic data derived from clonally evolving populations. We use a variety of computational approaches to characterize over-represented features in the data, with the underlying hypothesis that we may be detecting fitness-conferring features of the biology.
In Part I we consider the cross-sectional sampling of human tumors via RNA-sequencing, and devise computational pipelines for detecting oncogenic gene fusions and oncovirus infections. Genomic translocation and oncovirus infection can each be a highly penetrant alteration in a tumor's evolutionary history, with famous examples of both populating the cancer biology literature. In order to exert a transforming influence over the host cell, gene fusions and viral genetic programs need to be expressed and thus can be detected via whole transcriptome sequencing of a malignant cell population. We describe our approaches to predicting oncogenic gene fusions (Chapter 2) and quantifying host-viral interactions (Chapter 3) in large panels of human tumor tissue. The alterations that we characterize prompt the larger question of how the genetics of tumors and viruses might vary in time, leading us to the study of serially sampled populations.
In Part II we consider longitudinal sampling of a clonally evolving population. Phylogenetic trees are the standard representation of a clonal process, an evolutionary picture as old as Darwin's voyages on the Beagle. Chapter 4 first reviews phylogenetic inference and then introduces a certain phylogenetic tree space that forms the starting point of our work on the topic. Specifically, Chapter 4 describes the construction of our projective tree space along with an explicit implementation for visualizing point clouds of rescaled trees. The Chapter finishes by defining a method for stable dimensionality reduction of large phylogenies, which is useful for analyzing long genomic time series. In Chapter 5 we consider medically relevant instances of clonal evolution and the longitudinal genetic data sets to which they give rise. We analyze data from (i) the sequencing of cancers along their therapeutic course, (ii) the passaging of a xenografted tumor through a mouse model, and (iii) the seasonal surveillance of H3N2 influenza's hemagglutinin segment. A novel approach to predicting influenza vaccine effectiveness is demonstrated using statistics of point clouds in tree spaces.
Our investigations into clonal processes may be extended beyond naturally occurring genomes. In Part III we focus on the directed clonal evolution of populations of synthetic RNAs in vitro. Analogous to the selection pressures exerted upon malignant cells or viral particles, these synthetic RNA genomes can be evolved against a desired fitness objective. We investigate fitness objectives related to reprogramming ribosomal translation. Chapter 6 identifies high fitness RNA pseudoknot geometries capable of inducing ribosomal frameshift, while Chapter 7 takes an unbiased approach to evolving sequence and structural elements that promote stop codon readthrough
Activating mutations and translocations in the guanine exchange factor VAV1 in peripheral T-cell lymphomas.
Peripheral T-cell lymphomas (PTCLs) are a heterogeneous group of non-Hodgkin lymphomas frequently associated with poor prognosis and for which genetic mechanisms of transformation remain incompletely understood. Using RNA sequencing and targeted sequencing, here we identify a recurrent in-frame deletion (VAV1 Δ778-786) generated by a focal deletion-driven alternative splicing mechanism as well as novel VAV1 gene fusions (VAV1-THAP4, VAV1-MYO1F, and VAV1-S100A7) in PTCL. Mechanistically these genetic lesions result in increased activation of VAV1 catalytic-dependent (MAPK, JNK) and non-catalytic-dependent (nuclear factor of activated T cells, NFAT) VAV1 effector pathways. These results support a driver oncogenic role for VAV1 signaling in the pathogenesis of PTCL
Inferring causal molecular networks: empirical assessment through a community-based effort
Inferring molecular networks is a central challenge in computational biology. However, it has remained unclear whether causal, rather than merely correlational, relationships can be effectively inferred in complex biological settings. Here we describe the HPN-DREAM network inference challenge that focused on learning causal influences in signaling networks. We used phosphoprotein data from cancer cell lines as well as in silico data from a nonlinear dynamical model. Using the phosphoprotein data, we scored more than 2,000 networks submitted by challenge participants. The networks spanned 32 biological contexts and were scored in terms of causal validity with respect to unseen interventional data. A number of approaches were effective and incorporating known biology was generally advantageous. Additional sub-challenges considered time-course prediction and visualization. Our results constitute the most comprehensive assessment of causal network inference in a mammalian setting carried out to date and suggest that learning causal relationships may be feasible in complex settings such as disease states. Furthermore, our scoring approach provides a practical way to empirically assess the causal validity of inferred molecular networks
Inferring causal molecular networks: empirical assessment through a community-based effort
It remains unclear whether causal, rather than merely correlational, relationships in molecular networks can be inferred in complex biological settings. Here we describe the HPN-DREAM network inference challenge, which focused on learning causal influences in signaling networks. We used phosphoprotein data from cancer cell lines as well as in silico data from a nonlinear dynamical model. Using the phosphoprotein data, we scored more than 2,000 networks submitted by challenge participants. The networks spanned 32 biological contexts and were scored in terms of causal validity with respect to unseen interventional data. A number of approaches were effective, and incorporating known biology was generally advantageous. Additional sub-challenges considered time-course prediction and visualization. Our results suggest that learning causal relationships may be feasible in complex settings such as disease states. Furthermore, our scoring approach provides a practical way to empirically assess inferred molecular networks in a causal sense
Neuralized1 activates CPEB3: a function for nonproteolytic ubiquitin in synaptic plasticity and memory storage
International audienceThe cytoplasmic polyadenylation element-binding protein 3 (CPEB3), a regulator of local protein synthesis, is the mouse homolog of ApCPEB, a functional prion protein in Aplysia. Here, we provide evidence that CPEB3 is activated by Neuralized1, an E3 ubiquitin ligase. In hippocampal cultures, CPEB3 activated by Neuralized1-mediated ubiquitination leads both to the growth of new dendritic spines and to an increase of the GluA1 and GluA2 subunits of AMPA receptors, two CPEB3 targets essential for synaptic plasticity. Conditional overexpression of Neuralized1 similarly increases GluA1 and GluA2 and the number of spines and functional synapses in the hippocampus and is reflected in enhanced hippocampal-dependent memory and synaptic plasticity. By contrast, inhibition of Neuralized1 reduces GluA1 and GluA2 levels and impairs hippocampal-dependent memory and synaptic plasticity. These results suggest a model whereby Neuralized1-dependent ubiquitination facilitates hippocampal plasticity and hippocampal-dependent memory storage by modulating the activity of CPEB3 and CPEB3-dependent protein synthesis and synapse formation
Pegasus: a comprehensive annotation and prediction tool for detection of driver gene fusions in cancer
BACKGROUND: The extraordinary success of imatinib in the treatment of BCR-ABL1 associated cancers underscores the need to identify novel functional gene fusions in cancer. RNA sequencing offers a genome-wide view of expressed transcripts, uncovering biologically functional gene fusions. Although several bioinformatics tools are already available for the detection of putative fusion transcripts, candidate event lists are plagued with non-functional read-through events, reverse transcriptase template switching events, incorrect mapping, and other systematic errors. Such lists lack any indication of oncogenic relevance, and they are too large for exhaustive experimental validation. RESULTS: We have designed and implemented a pipeline, Pegasus, for the annotation and prediction of biologically functional gene fusion candidates. Pegasus provides a common interface for various gene fusion detection tools, reconstruction of novel fusion proteins, reading-frame-aware annotation of preserved/lost functional domains, and data-driven classification of oncogenic potential. Pegasus dramatically streamlines the search for oncogenic gene fusions, bridging the gap between raw RNA-Seq data and a final, tractable list of candidates for experimental validation. CONCLUSION: We show the effectiveness of Pegasus in predicting new driver fusions in 176 RNA-Seq samples of glioblastoma multiforme (GBM) and 23 cases of anaplastic large cell lymphoma (ALCL). Contact: [email protected]
Pegasus: a comprehensive annotation and prediction tool for detection of driver gene fusions in cancer
Background
The extraordinary success of imatinib in the treatment of BCR-ABL1 associated cancers underscores the need to identify novel functional gene fusions in cancer. RNA sequencing offers a genome-wide view of expressed transcripts, uncovering biologically functional gene fusions. Although several bioinformatics tools are already available for the detection of putative fusion transcripts, candidate event lists are plagued with non-functional read-through events, reverse transcriptase template switching events, incorrect mapping, and other systematic errors. Such lists lack any indication of oncogenic relevance, and they are too large for exhaustive experimental validation.
Results
We have designed and implemented a pipeline, Pegasus, for the annotation and prediction of biologically functional gene fusion candidates. Pegasus provides a common interface for various gene fusion detection tools, reconstruction of novel fusion proteins, reading-frame-aware annotation of preserved/lost functional domains, and data-driven classification of oncogenic potential. Pegasus dramatically streamlines the search for oncogenic gene fusions, bridging the gap between raw RNA-Seq data and a final, tractable list of candidates for experimental validation.
Conclusion
We show the effectiveness of Pegasus in predicting new driver fusions in 176 RNA-Seq samples of glioblastoma multiforme (GBM) and 23 cases of anaplastic large cell lymphoma (ALCL). Contact: [email protected]
p53 maintains baseline expression of multiple tumor suppressor genes
TP53 is the most commonly mutated tumor suppressor gene and its mutation drives tumorigenesis. Using ChIP-seq for p53 in the absence of acute cell stress, we found that wild-type but not mutant p53 binds and activates numerous tumor suppressor genes, including PTEN, STK11(LKB1), miR-34a, KDM6A(UTX), FOXO1, PHLDA3, and TNFRSF10B through consensus binding sites in enhancers and promoters. Depletion of p53 reduced expression of these target genes, and analysis across 18 tumor types showed that mutation of TP53 associated with reduced expression of many of these genes. Regarding PTEN, p53 activated expression of a luciferase reporter gene containing the p53-consensus site in the PTEN enhancer, and homozygous deletion of this region in cells decreased PTEN expression and increased growth and transformation. These findings show that p53 maintains expression of a team of tumor suppressor genes that may together with the stress-induced targets mediate the ability of p53 to suppress cancer development. p53 mutations selected during tumor initiation and progression, thus, inactivate multiple tumor suppressor genes in parallel, which could account for the high frequency of p53 mutations in cancer